Picture for Wenqiao Zhang

Wenqiao Zhang

Learning to Adapt: Self-Improving Web Agent via Cognitive-Aware Exploration

Add code
May 29, 2026
Viaarxiv icon

VisualThink-VLA: Visual Intermediate Reasoning for Effective and Low-Latency Vision-Language-Action Policies

Add code
May 28, 2026
Viaarxiv icon

MAIGO: Mitigating Lost-in-Conversation with History-Cleaned On-Policy Self-Distillation

Add code
May 26, 2026
Viaarxiv icon

InstructSAM: Segment Any Instance with Any Instructions

Add code
May 25, 2026
Viaarxiv icon

Regulating Anatomy-Aware Rewards via Trajectory-Integral Feedback for Volumetric Computed Tomography Analysis

Add code
May 19, 2026
Viaarxiv icon

EgoCoT-Bench: Benchmarking Grounded and Verifiable Operation-Centric Chain of Thought Reasoning for MLLMs

Add code
May 19, 2026
Viaarxiv icon

CrossView Suite: Harnessing Cross-view Spatial Intelligence of MLLMs with Dataset, Model and Benchmark

Add code
May 18, 2026
Viaarxiv icon

Catching Every Ripple: Enhanced Anomaly Awareness via Dynamic Concept Adaptation

Add code
Apr 16, 2026
Viaarxiv icon

IAD-Unify: A Region-Grounded Unified Model for Industrial Anomaly Segmentation, Understanding, and Generation

Add code
Apr 14, 2026
Viaarxiv icon

LMMs Meet Object-Centric Vision: Understanding, Segmentation, Editing and Generation

Add code
Apr 13, 2026
Viaarxiv icon